Scalable Boolean Tensor Factorizations using Random Walks

نویسندگان

  • Dóra Erdös
  • Pauli Miettinen
چکیده

Tensors are becoming increasingly common in data mining, and consequently, tensor factorizations are becoming more and more important tools for data miners. When the data is binary, it is natural to ask if we can factorize it into binary factors while simultaneously making sure that the reconstructed tensor is still binary. Such factorizations, called Boolean tensor factorizations, can provide improved interpretability and find Boolean structure that is hard to express using normal factorizations. Unfortunately the algorithms for computing Boolean tensor factorizations do not usually scale well. In this paper we present a novel algorithm for finding Boolean CP and Tucker decompositions of large and sparse binary tensors. In our experimental evaluation we show that our algorithm can handle large tensors and accurately reconstructs the latent Boolean

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Boolean Tensor Decomposition based on the Walk'n'Merge Algorithm

Tensor decomposition is a long standing data mining technique with numerous applications in different fields. It has been used in image processing, signal processing, computer vision, social network analysis and many more. It’s a useful tool for understanding high dimensional data, like classic matrix factorization (e.g. SVD) for two dimensional data. This work focuses on the decomposition of 3...

متن کامل

Quantum random walk search on satisfiability problems

Using numerical simulation, we measured the performance of several potential quantum algorithms, based on quantum random walks, to solve Boolean satisfiability (SAT) problems. We develop the fundamentals of quantum computing and the theory of classical algorithms to indicate how these algorithms could be implemented. We also discuss the development of quantum random walks and the principles tha...

متن کامل

Coalescent Random Walks on Graphs

Inspired by coalescent theory in biology, we introduce a stochastic model called ”multi-person simple random walks” or “coalescent random walks” on a graph G. There are any finite number of persons distributed randomly at the vertices of G. In each step of this discrete time Markov chain, we randomly pick up a person and move it to a random adjacent vertex. To study this model, we introduce the...

متن کامل

Random Walks in Weyl Chambers and the Decomposition of Tensor Powers

We consider a class of random walks on a lattice, introduced by Gessel and Zeilberger, for which the reflection principle can be used to count the number of K-step walks between two points which stay within a chamber of a Weyl group. We prove three independent results about such "reflectable walks": first, a classification of all such walks; second, many determinant formulas for walk numbers an...

متن کامل

Scalable Tensor Factorizations with Missing Data

The problem of missing data is ubiquitous in domains such as biomedical signal processing, network traffic analysis, bibliometrics, social network analysis, chemometrics, computer vision, and communication networks—all domains in which data collection is subject to occasional errors. Moreover, these data sets can be quite large and have more than two axes of variation, e.g., sender, receiver, t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1310.4843  شماره 

صفحات  -

تاریخ انتشار 2013